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Abstract. We show that world trade network datasets contain empirical evidence 
that the dynamics of innovation in the world economy follows indeed the concept of 
creative destruction, as proposed by J. A. Schumpeter more than half a century ago. 
National economies can be viewed as complex, evolving systems, driven by a stream 
of appearance and disappearance of goods and services. Products appear in bursts of 
creative cascades. We find that products systematically tend to co-appear, and that 
product appearances lead to massive disappearance events of existing products in the 
following years. The opposite - disappearances followed by periods of appearances - is 
not observed. This is an empirical validation of the dominance of cascading competitive 
replacement events on the scale of national economies, i.e. creative destruction. We 
find a tendency that more complex products drive out less complex ones, i.e. progress 
has a direction. Finally we show that the growth trajectory of a country's product 
output diversity can be understood by a recently proposed evolutionary model of 
Schumpeterian economic dynamics. 
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Joseph A. Schumpeter held that the key mechanism of economic development is 
radical innovation [U|2]. In his view the Walrasian economic equilibrium is continuously 
disturbed by actions of entrepreneurs, introducing novel goods and services in the 
market. These innovations may replace existing goods and services and thereby 
impact related industries. If this happens as a cascading process it is called creative 
destruction. Current colloquial examples of how once market-dominating companies 
lose their position due to creative destruction include instant photography or printed 
newspapers in light of the age of digitalization. But is this only the driving force behind 
major shifts in industrial production and long-term business cycles, or does it also 
condition economic change on much shorter time-scales? 

This question has been addressed using firm entry and exit dates [3], job creation 
and destruction rates |4], within specific countries [5] or within specific industries [6]. 
These works capture the complex interplay between factor re-allocation and productivity 
growth in existing economic sectors. However, little is added to our understanding of 
the impact of emerging industries on the development of already established parts of 
the national economy. If a new industrial branch emerges, how does this impact other 
economic sectors? 

Traces of the creation of unprecedented industries can be observed using world 
trade data. We study the dynamics of the diversity of export products and show 
that the process through which it changes follows the patterns of creative destruction. 
The set of products a given country exports reveals the presence of the nontradable 
inputs or capabilities that the products require for their production (e.g. specific 
productive knowledge, infrastructure, legal system, labor skills, regulations, etc.) [HE]. 
In this view capabilities are elementary building blocks and each product requires a 
combination of them to be manufactured. A change in the diversity of a country's 
product basket indicates a change in its set of capabilities. New capabilities may lead 
to new products and the abandonment or substitution of already existing products 
or capabilities. Products requiring one of these abandoned capabilities as input will 
then disappear, while simple products that require only subsets of capabilities of more 
complex products may disappear as they are unable to compete for these inputs - 
creative destruction at work. 

In this article we show that typically within a country clusters of products appear 
simultaneously in bursts. We interpret such a burst as the acquisition of a novel 
capability which is required as input for each of the newly appearing products. In 
the years following creative bursts there is an increased chance that already existing 
products will cease to be exported, that is, emerging industries effectively push out 
the old ones. The novel products tend to be more sophisticated and complex than 
the disappearing products, i.e. progress has a direction. Interestingly, we do not 
observe a higher probability of product appearances following disappearance events 
of products. This means that the canonical mechanism of filling "market niches" is 
not a significant feature of the data. We confirm empirically that creative destruction, 
cascades of competitive replacement, plays an important role in the development of 
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national economies. 

We apply a recently proposed Schumpeterian diversity dynamics model [2] to 
account for the evolution of the product diversity of countries. In this model 
entrepreneurs use and combine available capabilities to create novel goods and services 
and substitute them for already existing ones. The result is a model economy in a 
self-organized critical state, characterized by creative and destructive co-evolutionary 
avalanches. The changes in product diversity and the distribution of product 
appearances and disappearances observed in world trade data are well explained by 
this model. We base our results on the World Trade Flows database compiled by the 
National Bureau for Economic Research [TO] . 

1. Results 

1.1. Diversity dynamics 

The World Trade Flows database [10J contains exports of approximately 200 countries 
over the years 1984-2000 in about 800 product categories (4-digit SITC rev.2 
classification). We include only those countries in the analysis which have a population 
of at least 1.2 million people and total exports of at least 1 billion USD, for more 
information see Text SI in the SI. We label the countries contained in this database by 
c, the product category by p and the year by t. The export values x(p, c, t), denominated 
in USD are then extracted. As a diversification measure of country c in year t the number 
of products with nonzero export values is used, that is, D c {t) = J2 P sgn(x(p, c, t)). The 
total number of countries is denoted by Nc, the number of products by Np. 

The change of diversity of product exports in world trade data over a timespan 
of 16 years is shown in Fig{TJ(a)-(c). For each country index c the diversity D c (t) 
is shown for two different years, D c (ti = 1984) in Figfjja) and D c (t 2 = 2000) 
in FigflJ^b). The net change in diversity for each country between 1984 and 2000, 
AD C = D c {t 2 = 2000) - D c (h = 1984) is shown in FigU^c). Values from trade 
data are shown in blue and are compared to results of the Schumpeterian diversity 
dynamics model (in red, to be explained below). There is a general trend towards 
increased diversity. Countries with a relatively low or high diversity in 1984 show 
smaller fluctuations in diversity than countries with intermediate initial diversity. That 
is, countries with a low diversity tend to stay poorly diversified, fully diversified countries 
stay fully diversified. In between there is a regime of transition countries, some of them 
showing explosive growth in terms of their economic diversity. We show later that these 
observations can be described through the onset of a 'creative phase transition' [2], a 
distinguishing property of Schumpeterian diversity dynamics. Let us now empirically 
investigate the process by which the countries' product export diversities change. 

Let A(p, c, t) be a product indicator function for the appearance of product p in 
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Figure 1. Countries' export product diversities for (a) 1984, (b) 2000 and the net 
change in diversity between these years (c) are shown for world trade data (blue 
lines) and the Schumpctcrian diversity dynamics model (red). There is a general 
tendency towards increased diversity which is strongest for countries having initially 
an intermediate product diversity. The results for diversity change are obtained by 
using only the subset of countries having non-zero export values in each year. The 
number of (dis)appearance events for each (d) country and (e) product are shown 
in the panel to the right for data and model. The model does not only reproduce 
the general trend of increasing diversity, but also the detailed profile of how many 
appearance and disappearance events could be observed for different countries and 
products. Note that in (a)-(c) countries are ranked by their initial diversity, whereas 
in (d) they are ranked by their number of events. 



country c between year t — 1 and t, 

J 1 if x(p,c,t - 1) < 9 and x(p,c,t) > 9 , 
A ^ c ^~ | o otherwise . (1) 

Similarly the indicator function for a product disappearance is 

J 1 if x(p, c, t - 1) > 9 and x(p, c, t) < 9 , 
D ^ C ^ = \0 otherwise , (2) 

with a threshold value set to 9 = 100,000 USD. For more details see Fig. SI FigJT](d) and 
(e) show the distribution of (dis) appearance events A (D) per country c and product 
p for world trade data and the Schumpeterian diversity dynamics model. They are far 
from being homogeneously distributed. For example, the number of appearance events 
per country varies between more than 300 and almost zero across different countries. 
Further there is a substantial number of products appearing in, say, five countries or 
less, while others appear in almost one third of all the countries studied. The model 
captures the functional form of these distributions. The number of appearances clearly 
exceeds the number of disappearances, consistent with the general trend towards higher 
diversity. 
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1.2. Co-occurrence analysis 

Creative destruction can be described as a process started by introducing novel goods or 
services in a national economy. If successful, this stimulates the market introduction of 
related goods, complimentary to the newly introduced ones. Thereby a novel cluster of 
inter-related products can form. This cluster may render existing parts of the national 
economy obsolete. If this process is actually at work within national economies, one 
would expect two empirical facts to hold. To pin down this very process empirically, 
one needs to measure two features, (i) products enter the market in creative bursts, i.e. 
not incrementally and (ii) the appearance of goods tends to foster the disappearance 
of other products. We quantify this with two different product indicators referred to 
as 'Schumpeterian Product Indicators' (SPI). The first is introduced to assess co- 
occurrences of product (dis) appearances and is denoted SPIx Y (p). Here X and Y 
stand for any combination of events A, D for product p. The second SPI measures if an 
appearance or disappearance event of products at t is correlated to an event of another 
product within the following r years and is denoted by SPI^yip)- 

The marginal appearance (disappearance) frequency Pa(p) (Pd(p)) f° r each product 
is given by 

Pa(p) = Ect A (P,c,t) , 

Pd(p) = E c , t D(p,c,t) . { > 

The number of co- appearances Paa(p,q) of products p and q across all countries and 
times is, 

PAA(jP,q)='£ t A(jp,c,t)A(q,c,t) . (4) 

c,t 

The co-occurrencdj] statistics Pad(p,q) and Pdd(p,q) are obtained by appropriately 
substituting disappearance events D(p,c,t) for A(p,c,t) in Eq. HI Obviously products 
with a relatively high number of appearances will also be more likely to co-appear. 
Consequently whenever one compares co-occurrence statistics of two different pairs of 
products one has to correct for this bias. A simple way to do this is to compare the 
number of measured co-occurrences of two products to their marginal (dis) appearance 
frequencies given in Eq. [3j For this we define the pairwise conditional co-occurrence 
measure as Paa(p, q) = max [p^'p A ( q )] • The idea is that given two products p and q we 
take the product with the higher appearance probability, say p, and measure how often 
q appears conditional on an appearance event of p in the same country. To quantify in 
how many co-occurrence events a given product participates, one can define an SPI for 
simultaneous appearances SPI\ A {p) by 

SPPaaIp) = jfY, pAA M ■ (5) 
g 



| By co-occurrence we will refer to any of the four possible pairs of events for two products p and q, 
that is (dis) appearance in p together with (dis) appearance in q. 
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where A^ is a normalization factor given by M = (Np — 1)Nq guaranteeing that the 
index lies within the range (0, 1) and is thus comparable across datasets of different sizes. 
It is straight forward to define SPIs for other pairs of events XY G {AD, DA, DD}. 

We proceed to quantify to what extent an appearance event in year t is related to 
disappearance events at a later time t' > t. Assume a maximal lag of r years between 
the two events, t < if < t + r. If not denoted otherwise, we work with a value of 
t = 3 years. A simple measure is the number of all appearances of p which are followed 
by disappearances of q within the next r years in the same country and summed over 
all countries, i.e. P A T ^(p,q) = t Ylt^=t+i A(p, c it)D(q, c, t'). This number can be 
compared to the count of appearances of product q followed by disappearances of p, i.e. 
we exchange the roles of p and q. If there an asymmetry emerges when exchanging p 
and q in P A T ^(p, q), this indicates that p appearing before q disappears is observed more 
often than the other way around. In this spirit, for each product p a time-lagged SPI 
is defined as 



This index is within the range (—1, +1). Intuitively, if one thinks of the appearance of 
p followed by disappearance of q within a country as a replacement, a positive value of 
SPI A ^(p) means that, on average, p replaces more often any other product q than p 
itself is replaced by q. The higher the SPI^bip) value for p, the higher the tendency 
that p can act as a substitute for other products. 

1.3. Products appear in bursts 

In Fig. [3] we show the maximum spanning tree for products obtained from the pairwise 
conditional co-appearance measure, Paa(p, q)- The clusters highlighted there point at 
a non-random structure in product co-appearance dynamics. To make this structure 
explicit we compare SPI AA {p) values for trade and the surrogate data, see histogram in 
Fig. Eta). A huge difference between trade and surrogate data is found. This indicates 
the presence of strong temporal correlations between individual product appearances. 
Products appear simultaneously in bursts. A way to interpret this is that the co- 
appearing products require a common capability to be manufactured. When a national 
economy acquires this capability the corresponding cluster of products can appear. In 
this sense the observation of a newly emerging cluster of inter-related products serves 
as a proxy for a country's development of a novel capability. This process is not unique 
within a given country, it can be observed in a substantial fraction of them (since the 
indicator values are averages calculated over more than one hundred countries). This 
may hint at common patterns in the diversification trajectory of individual national 
economies. 




(6) 
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Figure 2. Histograms for Schumpeterian Product Index (a) SPI AA (p) for co- 
appearances AA, and (b) SPI%l(p) for t = 3 years. SPIs are shown for the 
surrogate data (blue) and the trade data (red). The distributions for the trade data 
are significantly shifted to the right when compared to the surrogate data. The 
distribution in (a) suggests that products appear in bursts, while (b) implies that 
appearing products tend to drive other products from the market. 



1.4- Creative destruction at work 

There is a tendency for specific products to substitute or replace others. This can be 
seen as deviations in the distribution of indicator values SPI A ^ D {p) between world trade 
and surrogate data, see Fig. Efb). A high value of P^x>(p, q) — Pad(q,p) corresponds to 
the dominance of the pattern: 'product p appears and q disappears later in the same 
country' in a large number of them independently. This is a direct fingerprint of creative 
destruction at work, i.e. emerging industries pushing out the old. It is interesting to note 
that the opposite process - the disappearance of a product is followed by the appearance 
of another one - is not observed to a significant extent, see Figs. S2 and S9. For the 
DA case surrogate and data SPIs are practically identical. To measure the statistical 
significance of deviations from the surrogate data we formulate the null hypothesis that 
both surrogate and trade data are drawn from a normal distribution with the same mean. 
The p-value using the alternative hypothesis that the trade data shows a higher mean 
than the surrogate data is computed for all four possible combinations: AA, AD, DA, 
and DD. The results are listed in Tab. 1, together with p- values from various robustness 
tests as described in the SI. This series of tests confirms that the results above are not 
a consequence of trivial effects such as fluctuations in the trade records (case '# = 200k 
USD'), the artificial creation of product categories due to changes in reporting schemes 
('all products with positive exports') or the transition out of communism of former 
Soviet countries ('Excl. FSU'), see SI. Results are also robust with respect to changes 
in the time-period ('1989-2000'), the choice of the maximal lag r (V = 5','r = 7') or 
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the underlying dataset ('UN Comtrade data'). 

1.5. Progress has a direction 

Is it true that more complex products replace less complex ones? Do products produced 
by richer countries replace products associated with poorer ones? To elucidate this, 
we quantify the change in complexity and income levels associated with individual AD 
processes. The creative destruction dynamics is dominated by products with a high 
value of KI(p) or XI(q). APCI(p,q,r) and APRODY(p,q,r) are thus measured for 
each AD process with r = 3, where the appearing (disappearing) product has one of 
the hundred highest values of KI(p) (XI(q)). A positive APCI indicates an increase 
in economic complexity of the country where this AD process was observed. A positive 
APRODY indicates that the country is upgrading its exports towards those made by 
richer countries. Histograms for APCI and APRODY are shown in Fig. 0J There is a 
clear skew to the positive side for both quantities. Products with a higher tendency to 
be pushed from the market have in general lower economic complexity than products 
from appearing industrial branches. 

One could imagine that this effect is stronger in emerging economies than in 
mature ones. To test this, one can compare APCI and APRODY distributions for 
different economies by aggregating the countries into seven regions (Latin America and 
the Carribean, East-Asia and the Pacific, Middle-east and North-Africa, Sub-Saharan 
Africa, South- Asia, Western Europe and Northern America, Eastern Europe), as done 
in the SI. A clear development towards higher economic complexity can be seen for Latin 
America, Eastern Europe and East Asia & Pacific. The other regions do not display 
such a strong trend. Western Europe and Northern America are already almost fully 
diversified and stay that way, whereas e.g. Sub-Saharan Africa stays in the low diversity 
regime. These results are discussed in more detail in the SI. 

1.6. Which products are the killers? 

Let us get a clearer picture about the products driving the creative destruction process. 
Are there Learner classes whose products appear more often than others? Which types 
of products disappear instead? We consider the matrix P^ D (p,q). Each product can 
be assigned to one of ten Learner classes, labeled by k, i £ {1, ... 10}. One can obtain 
a measure for AD processes where products from class li appear and products from lj 
disappear by computing the mean value of P^(p, q) over all products from the respective 
Learner classes. To this end L(/j, lj) = j^(Pa d (p, q))peh,qeij is defined. As before, we are 
only interested in the 'net flow' of AD processes, the antisymmetric Learner transition 
matrix, U = L — L T . This is basically the same measurement strategy as discussed for 
Pad(p, q) — Pad (QiP)i however aggregated to the level of Learner classes. The matrix is 
shown in Fig. |SJ 

A positive value of H(h, lj) indicates that an appearance event of a product from 
class k followed by disappearance of a product from L, is more often observed than an 
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Petroleum Forest Products Animal Agriculture Labor Intensive (^) Machinery 

Raw Materials Tropical Agriculture (^) Cereals (^) Capital Intensive (^) Chemicals 



Figure 3. Maximum spanning tree for the network given by the co-occurrence measure 
Paa(p, q)- Some of the clusters of co- appearing products are highlighted as guides to 
the eye. It is suggested that these clusters of products require a common capability 
before they can be exported. Once a country acquires all of these capabilities, a creative 
burst of novel products which require them as input may be the consequence. 
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Figure 4. Histograms for change in (a) income level PRODY and (b) product 
complexity PCI associated with individual creative destruction processes. There is 
a skew to the positive side in both of them, indicating that national economies are 
typically restructured into the direction of more complex and higher income products. 



appearance from lj followed by disappearance in Zj. Given that AD processes convey 
information about how national economies are re-structured over time, positive values 
in II indicate which parts of the economy are abandoned or disappear because of which 
other parts. There is a clear trend. Capital and labor intensive products, as well as 
machinery and chemicals appear much more often than they disappear. Agricultural 
products and cereals tend to disappear. This intuitively confirms that markets re- 
structure themselves towards higher economic complexity as observed in Fig. HI Note 
that these observations can not be described by diffusion or migration processes of e.g. 
production facilities or capabilities from one country to another. Each measurement of 
appearances followed by disappearances takes place within a single country. A product 
appears within a country and another product disappears later within the same country. 



1.7. A simple model 

A recently introduced Schumpeterian diversity dynamics model [2j centers around the 
assumption that countries have an evolving set of capabilities which firms combine 
to manufacture products, see SI. The economic state of a country is represented 
by two (high-dimensional) binary vectors, one indicating whether a country has a 
given capability, the other indicating whether a country exports a given product. 
These vectors evolve over time in each country. Capabilities can be acquired through 
entrepreneurs who combine existing capabilities. For example, an existing production 
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Figure 5. Transition matrix II between Learner classes. For each pair of classes 
{h,lj) it is measured how often an appearance event of a product from class £j is 
observed followed by a disappearance of a product from class lj, compared to the 
opposite direction. Rows are indexed by the appearing Learner classes U, columns 
by disappearing ones lj. A positive value indicates an excess in creative destruction 
processes between the two classes (from row index to column index). The matrix is 
by construction anti-symmetric. There is a clear tendency of appearing labor and 
capital intensive products, as well as machinery and chemicals. Cereals or agricultural 
products tend to disappear. 

infrastructure and a certain type of knowledge stock can be combined to acquire an 
upgraded production facility. This scheme is shown in Fig. [6j where capabilities are 
represented by blue squares. In this case capability k is a combination (blue ellipse) of 
i and j. The model capability assumes that if a country has capability i and j, it will 
acquire k in the next time-step. 

Each product requires a set of capabilities in order to be produced. In the model 
a country exports a product if it has all necessary capabilities. The simplest possible 
case (one capability needed for one product) is shown in Fig. |6] where products are 
represented by red circles. Product p requires capability i, product q requires k. As 
soon as a country acquires k, it will start to report exports in q. Since one capability 
may be required for more than one product, all of these products may co-appear with 
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Capabilities can also be lost or abandoned. There is a chance that q may act as 
substitute for p. In this case the acquisition of capability k renders % obsolete. With 
some probability p~ capability i is thus abandoned or destroyed because of k. As a 
consequence p will then no longer be exported. 

In the model it is further assumed that the rules of how capabilities can be 
combined, substituted and used as inputs for products are identical in each country. 
Economies only differ by their initial diversity of products (from which the implied 
diversity of capabilities is calculated). The model is iterated until the number of model 
(dis) appearances matches the number of events observed in the trade data. Fig. [1] shows 
a comparison of (a) initial and (b) final product diversity, (c) net diversity change and 
the number of appearances for each (d) country and (e) product for model and trade 
data. The complex observed patterns of creative destruction can be explained by the 
almost embarrassingly simple process of recombining and substituting capabilities. The 
crucial feature of the model is the existence of a creative phase transition with a position 
depending on the initial diversity [T5l [16]. If it is below a certain threshold, there are 
not enough capabilities available to find successful combinations of them. If a country 
is at, or above this threshold the creative destruction process kicks in, restructuring the 
market [2]. For a complete model specification, including dynamical algorithm, see the 
SI. 

2. Discussion 

The main novelty of the approach in this article is to identify, in a large sample of 
countries, the existence of a systematic relationship between the appearance of new 
industries and the appearance or disappearance of other industries within the same 

Table 1. p- values for SPI X y histograms from trade vs. surrogate data. The column 
V = 3' lists the results as described in the main text. In addition, robustness tests 
where conducted with results listed in separate columns. '9 = = 200k USD' uses the 
same set of countries and a threshold of 200.000 USD below which trade flows are 
ignored. In 'all products with positive exports' all products are included which have 
positive world exports in each year of the analysis. The column '1989-2000' decreases 
the number of years included in analysis. Results excluding the FSU and CEE are 
listed in 'Excl. FSU'. The maximal lag r is then increased to r = 5,7. The last 
column reports results using the UN ComTrade dataset, as described in the SI. 
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Figure 6. Illustration of the Schumpeterian diversity dynamics model. Capabilities 
are represented by blue squares, products by red circles. A country can acquire 
capability k by combining other capabilities i and j. Each product requires a set 
of capabilities to be produced, e.g. product p requires i and q requires k. There is a 
chance that a novel product q may act as a substitute for a product p which is made 
from a subset of capabilities required for k. In this case k may effectively 'destroy' its 
preceding capability i. 

country. In particular our analysis reveals that (i) products appear cluster-wise in 
creative bursts which consequently (ii) increases the chance for other, already existing 
products to be pushed from the market and (hi) the emerging products are typically 
associated with higher income and a higher level of economic complexity. The effect of 
shifting production towards higher complexity is strongest in the developing countries 
of Latin America, Eastern Europe and East Asia and Pacific and barely visible in the 
least developed countries or Europe and North America (see SI). 

These observations can be explained within an evolutionary model of Schumpeterian 
economic dynamics [2 J . Just as the product portfolio of countries can be explained as a 
consequence of the presence of productive capabilities in that country [8], changes in a 
country's product portfolio can be understood through the evolution of its capabilities. 
Entrepreneurs (firms) upgrade and recombine existing capabilities to create innovations. 
These may substitute existing industries, restructuring the market. Such a system can 
exist in three different modes [15} [16]. First, if the initial number of capabilities is 
too low to allow for a sufficient number of novel combinations, the creative destruction 
process may not set in at all. Second, if the country is already fully diversified progress 
is also slowed down. In between these regimes there is a third, transition regime, where 
the creative destruction process is most effective. We calibrate the model with initial 
product diversities and (dis) appearance rates observed through world trade data, and 
can reproduce (i) the position of the transition regime from low to high diversity, (ii) 
the patterns of (dis) appearance frequencies per country and (iii) the distribution of 
(dis) appearance frequencies for individual products. 
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Beyond finding evidence of creative destruction in a large set of national economies, 
this paper also helps to further reveal the complex topology of industry relatedness. 
Understanding 

3. Materials and methods 

3.1. Surrogate data 

For the purpose of statistical analysis we construct a surrogate dataset with the aim to 
destroy the correlations in the timing of (dis) appearances while the event statistics 
(as shown in FiglT] (d) and (e)) are preserved. The surrogate data is prepared as 
follows. Each event is given by a triplet (pi, Ci, ti) where index i runs over all appearance 
(disappearance) events in the data. For each i we fix p^ and c« while shuffling the 
years ti between the events in the triplets. Formally this defines a random permutation 
V : {i} i — y {i} over the set of all event indexes i. One may then calculate the co- 
occurrence measures from the triplet (pi,Ci,V(ti)) and average the result over many 
realizations of V. This ensures that the marginal distributions of the events (Fig. [1] 
(d) and (e)) remain unchanged while all correlations in the timing of appearances or 
disappearances are destroyed. 

3.2. Network 

The creative bursts of cluster-wise co-appearing products can be effectively displayed 
as a network. The pairwise conditional co-appearance measure, Paa(p,q), allows to 
construct similarity matrices for product categories from which the maximum spanning 
tree, shown in Fig. [3j is constructed (see SI). Nodes in the network represent product 
categories according to the SITC rev. 2 4-digit classification. The color of the nodes 
represents the products' Learner classes p2]. This is a classification based on relative 
factor intensities such as amount of capital, labor, land, skills etc. A similar route has 
been followed to construct the product space [9]. Fig. [3] suggests that the clusters of co- 
appearing products require one or several common capabilities. Once a country acquires 
or upgrades these capabilities, it starts to report exports in these product categories. 
Their appearances are simultaneously observed as a creative burst within the same 
Learner class. 

3.3. Product Indicators 

Product Income Indicator PRODY. This indicator for a given product is the weighted 
average of the per capita GDPs of countries exporting it. It is a weighted average of 
the income per capita of the countries that have revealed comparative advantage in 
that product [13]. A more sophisticated product, in principle, should be made by 
richer countries. Let us define the difference in PRODY for products p and q as 
APRODY(p,q,r) = PRODY(p\ appears at t) - PRODY(q\ disappears at f) with 
t < t' < t + r. 
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Product Complexity Indicator. The Product Complexity Index PCI is an indicator 
for the economic complexity involved in manufacturing a given product [H] . The PCI 
is a combination of the ubiquity of a product (i.e. the number of countries that make it) 
and the economic diversity of its exporting countries, see SI for more details. Products 
with high PCI typically are made by few highly diversified countries which is indicative 
of high economic complexity j7]. The change in complexity APCI(p, q, r) between two 
products is given by APCI(p, q, r) = PCI{p\ appears at t) — PCI(q\ disappears at t'), 

t<t' <t + T. 

Killer Index. We refer to KI(p) = SPI^ D {p) as the 'Killer Index' for each product 
p. It measures the likelihood that the appearance of a given product p is observed with 
the disappearance of any other product in the next r years. 

Extinction Index. The likelihood that a product disappears together with the 
appearance of any other product r years earlier is expressed by its 'Extinction Index' 
XI{q) = E p Pad(p,Q) - P { lkl,P)- We set r = 3. 
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5. Description of Datasets 

We use two independent data sets in this work. 

• NBER trade data [10J. Compiled by the National Bureau of Economic Research, 
this set of bilateral trade data by commodity spans the period 1962-2000. Trade 
flows (in USD) are reported in product categories following 4-digit SITC rev. 2 
classification. This dataset is a combination of two others, spanning 1962-1983 
and 1984-2000 respectively. We work with the timespan 1984-2000 to exclude any 
possible artifacts in the results due to changes in data collection between these 
two timespans. The NBER trade data introduces artificial product categories 
(containing 'A's and 'X's in the SITC code) to account for differences in import 
and export records (i.e. if country A exports to countries B,C, but A's export 
record deviates from (B+C)'s import records). We only focus on export data and 
exclude these artificial product categories. Finally, we only include 'real' countries 
(the dataset also lists world regions, such as Southern Asia or Oceania, etc.). This 
results in longitudinal trade data for 200 countries in 800 product categories over 
17 years. 

• COMTRADE trade data [11] • The United Nations Commodity Trade Statistics 
Database (UN COMTRADE) publishes annual international trade statistics data 
by commodities and partner countries. We use data from the timespan 1990-2010. 
Export values (in USD) are reported in HS1992 product categories for over 170 
countries (again, leaving aside world regions), amounting to roughly 5000 categories 
over 21 years. 

Let A(p, c, t) be a product indicator function for the appearance of product p in 
country c between year t — 1 and t, 



Similarly the indicator function for a disappearance event is 

J 1 if x(p, c, t - 1) > and x(p, c, t) = , 
^' C ' t) = {o otherwise . (8) 

Note that these definitions are only useful if there exists a data record for c at both 
t and t—1. We exclude small countries from the analysis by demanding a population of 
at least 1.2 million people and total exports of at least 1 billion USD, leaving us with a 
list of 125 countries. The reported results for the SPI where computed over the timespan 
1984-2000. Individual trade flows between countries are only included if they exceed 




1 if x(p, c, t — 1) = and x(p, c, t) > 
otherwise . 



(7) 
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100000 USD Furthermore, appearance and disappearance events are not included if one 
of the following is true. 

• In the year before product p appears (or after p disappears) in a country c, the 
country reports positive exports in less than fifty different categories. 

• The diversity time-series sgn(x(p, c, t)) of the product does not belong to one of the 
four types shown in Fig. £[TJ 

After these filtering procedures we assign each product a appearance or disappearance 
event (or no event at all). If existing, each product in each country is assigned a unique 
appearance (disappearance) event which is its first (last) measured event. For this 
particular event we set A(p,c,t) = 1 (D(p,c,t) = 1) and zero at each other point, in 
Fig. 31] appearance events are highlighted by green slashed lines, disappearances by red 
dashed lines. 

6. Robustness 

Several types of robustness checks are conducted. The analysis is repeated with a 
threshold of 200,000 USD for export values, that is whenever x(p,c,t) < 200000 we set 
x(p, c, t) = 0. The results remain basically unaltered and are thus not driven by small, 
fluctuating export values, see column 'threshold' in Tab. 1. The next check concerns 
artificial product appearances / disappearances due to the creation or abandonment 
of product categories or countries. To this end only product categories are included 
in the analysis which have positive world exports in each year. This does not affect 
the analysis, see column 'all products with positive exports'. Thirdly the influence of 
the time-span on the results was checked. The time-span was reduced to 1989-2000, 
see 'cleaned data 1989-2000'. Former communist transition countries from the FSU are 
excluded in the column 'Excl. FSU'. Then the maximal time lag is varied to r = 5 
and r = 7 years respectively. Finally the same analysis is carried out with the UN 
ComTrade data. 



(a) (b) (c) (d) 




t t t t 



Figure 1. Appearance and disappearance events in product p in country c are only 
included if the underlying diversity timeseries sgn(x(p, c, t)) is of one of the types (a)- 
(d). The time of the appearance event is then always the first appearance (green 
dashed line in (a) and (b)), disappearances are the last events (red dashed lines in (c) 
and (d)). 



Empirical confirmation of creative destruction from world trade data 



19 




Figure 2. Histograms for Schumpeterian Product Index SPI^ Y {p) for co-appearances 
A A, co-disappearances DD, and the mixed forms AD and DA (from top to bottom) 
for maximal time-lag r = 3. The SPI is shown for the surrogate data (blue) and the 
trade data (red). A significant fraction of higher SPI values for the trade data are seen 
for appearance-disappearances. 

7. Maximum Spanning Tree 

The MST is the network with the least number of links with the highest possible weights 
spanning each node of the network. It can be computed using Kruskal's algorithm. The 
MST can be regarded as the 'skeleton' or 'backbone' of a network. It positions each 
node in the neighborhood of other nodes to which it has strong ties and is therefore 
reminiscent of hierarchical clustering procedures. 

8. Result for COMTRADE database 

All results reported in the main text can also be found in the COMTRADE database. 
We have chosen to work with the NBER data for the following reasons. The product 
classification employed for COMTRADE has gone through several revisions, namely 
HS1992, HS1996 and HS2002. Each of these revisions causes artificial appearances 
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PCI 

Figure 3. Comparison of PRODY and PCI values for each product category. 

and disappearances (due to re-classification of products). At the time the dataset was 
extracted a substantial amount of trade data was still to be reported causing artificial 
disappearances, thus one can only effectively work with the range 2000-2009. The NBER 
database, on the other hand, is the outcome a collaborative research effort to provide 
a coherent dataset. For example, if a country only reports its trade flows in 3-digit 
S1TC or an older revision, this is painstakingly checked against the reports of the trade 
partners and records from the US trade database to 'fill the gaps'. This reduces the 
number of artificial appearances and disappearances. A detailed description of how the 
NBER data was composed can be found in [10]. p- Values for the comparison of trade 
and surrogate data are shown in Tab. 1, column 'COMTRADE' 

9. Further results on product comparisons 

The PRODY value was computed as in [13] using NBER trade data as an average over 
each PRODY from the timespan 1993-2000. To compute the PCI we extract the matrix 
of significant exporters M cp as defined in [7] from the NBER trade data for each year 
from 1993-2000. Using k Cj0 = J2 P M cp and /c Pi o = M cp one computes the matrix 
Mpq = J2 C T^FV • ^ ne PCI is the average over all years of the eigenvector associated 
with the second largest eigenvalue of M pq . Figj3]shows how these two indexes are related. 

10. Further results on country comparisons 

Not each region in the world shows the same pattern of development towards higher 
economic complexity. We repeat the analysis of complexity change associated with 
AD processes for countries from Latin America and the Carribean, East-Asia and 
the Pacific, Middle-east and North-Africa, Sub-saharan Africa, South-Asia, Western 
Europe and Northern America, as well as Eastern Europe separately. That is, we use 
only countries from the respective regions, compute SPI^ D {p) for these countries and 
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Figure 4. Histograms for change in (top row) income level PRODY and (bottom 
row) product complexity PCI associated with each AD process for seven different 
world regions (Latin America and the Carribean, East-Asia and the Pacific, Middle- 
east and North- Africa, Sub-saharan Africa, South- Asia, Western Europe and Northern 
America, Eastern Europe) . There is a skew to the positive side most clearly visible for 
Latin America and the Carribean, as well as for CEE countries. 



compute APRODY and APCI as described in the main text, see Fig. SHJ Transitions 
towards higher economic complexity (skew to the right in the histograms) is clearly 
visible for Latin America and Eastern Europe. In other regions this trend is less visible 
or even absent. This supports the notion of 'critical economies' where Schumpeterian 
economic evolution is successful at work. These critical economies can be identified as 
transition countries. For low diversity countries (compare e.g. Sub-saharan Africa) there 
is simply 'too little to start with' in order to move towards higher economic complexity, 
with respect to this they can be called 'sub-critical'. Fully diversified countries (compare 
Western Europe and North America) also defy this trend. Since they are already at the 
top of the 'complexity scale', there is not much room for improvement here. This may be 
due to an too coarse resolution of the standard product classification. These countries 
can be dubbed supra-critical. 

Eastern European countries have undergone a major shift in their political regimes 
in the timespan we use for the measurement. One may wonder whether the shift 
towards higher economic complexity is due to developments under the centrally planned 
economy before 1990 or afterwards. The analysis is repeated for these countries using 
the timespan 1984-1989, see Figs. SEJa) and (b). Compare these histograms to the ones 
observed between 1990-2000 in Figs. S^c) and (d). There is no clear movement towards 
higher complexity before 1990. This suggests that the move towards higher complexity 
observed later is due to the shift from a centrally planned to a free market economy. 

In analogy to the SPI a Schumpeterian Country Index SCI is defined for AD and 
r = 3 as 

SCI(c) = ( A (P>c> t)D(q,c, if) -A(q,c, t)D(p,c,t')) t < if < t+r .(9) 

p,q,t,t' 
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Figure 5. Histograms for change in income level PRODY and (a) product complexity 
PCI (b) associated with each AD process for Eastern European countries between 
1984-1989, that is in a centrally planned economy. The development is more or less 
symmetric. Compare this to the clear positive skew in (c) and (d), where changes in 
PRODY and PCI for the same countries between 1990-2000 are shown. This suggests 
that the increase in economy complexity observed there is due to the shift from planned 
to market economy. 



We show that the SCI measure is not an artifact of the number of appearances 
alone in Fig. SE1 where the SCI for each country is plotted against the number of 
appearances in this country. Countries like Portugal and Hongkong have one of the 
lowest ranking SCIs, Ecuador and Lithuania one of the highest ranking ones. But each 
of these countries have rougly the same number of product appearances. 

Countries with lowest ranking SCI values are countries which are either already 
fully diversified and show thus only little activity in terms of (dis) appearances (USA, 
Germany, France, China, UK, ...) or countries with constant low diversities (Malawi, 
Gabon, Uganda, ...). Countries with a relative high SCI include Uruguay, Kuwait, 
Croatia, ... and tend to be bridge countries between these two regimes. Fig. S3 shows 
countries and their SCI and Gross Domestic Product (GDP) per capita of 2000. Western 
countries are mostly found at the bottom right (low SCI and high GDP), poor countries 
to the bottom left (low SCI and low GDP), countries with high SCI form a bridge 
between those two regimes. 

The virtues of having or not having a product are shown in Fig. £[8j For each 
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Figure 6. Number of appearances vs SCI for each country. This shows that SCI can 
not be explained by the number of appearances alone. Observe that e.g. Ecuador and 
Portugal have the same number of product appearances, but Ecuador has one of the 
highest SCIs and Portugal one of the lowest. 
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Figure 7. GDP vs SCI for each country. Countries cluster into two distinct groups. 
Western countries are mostly found at the bottom right (low SCI and high GDP), poor 
countries to the bottom left (low SCI and low GDP), countries with high SCI form a 
bridge between those two regimes. 
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Figure 8. GDP vs average PRODY of products not being exported by this country. 
High GDP countries tend to not export products with a low PRODY, i.e. low economic 
complexity (upper left region of the plot). Low GDP countries miss products with a 
high PRODY in their export baskets, they are found in the lower right region. Oil 
exporting countries such as Qatar and the United Arabian Emirates deviate from 
this trend. They achieve high GDP per capita values with significantly less complex 
products. 



country we compute the average PRODY value of products which are not exported by 
this country as of 2000. High GDP countries (compare Canada, Austria, USA, Norway, 
Sweden, ...) do not export products which have on average a significant lower PRODY 
than developing and least developed countries. There are some outliers, for example oil 
exporting countries achieving a high GDP per capita with low complexity products. 

Since the transition countries from the Former Soviet Union are main contributors 
to the AD process, as can be observed from the previous results shown in Fig. 3U Fig. 
Sj6] and Fig. S0 it is worth to show that the significance of the AD results is not due 
to the contributions of these countries alone. To this end we compute histograms for 
SPIxyip) f° r a se t of countries excluding the FSU and Eastern Europe, see Fig. S2] 
and the respective column of p- Values in Tab. 1 entitled 'Excl. FSU'. 

11. Schumpeterian Evolutionary Model 

The occurrence of bursts of creation (product appearance) and destruction (product 
disappearance) is a prominent feature of Schumpeterian economic dynamics j2]. We 
show that this model can be used to describe the development of national economies in 
terms of product (dis) appearances as Schumpeterian 'gales of destruction'. The model 
is a variant of [2] with some features proposed by [7] . The components of the model are 
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Figure 9. Histograms for Schumpeterian Product Index SPlZyip) f° r co-appearances 
A A, co-disappearances DD, and the mixed forms AD and DA (from top to bottom) 
for maximal time-lag r = 3. Countries from the Former Soviet Union and Eastern 
Europe are excluded. The SPI is shown for the surrogate data (blue) and the trade 
data (red) . A significant fraction of higher SPI values for the trade data are seen for 
appearance-disappearances. 



now described in detail. 



11.0.1. Model Components Products. For each country c the binary product vector 
7r c ,p(t) records whether product p is currently produced (ir c ,p{t) = 1) or not (7T CjP (i) = 0). 
To produce p it is required to possess a specific set of capabilities. 

Capabilities. A country's product portfolio is determined by the available 
nontradable capabilities residing in it [7]. Each country c's set of capabilities is described 
be a time-dependent vector a c (t) with binary components. The z'th entry in a c (t) 
indicates whether the capability % is available in country c at time t (cr c> j(t) = 1) or is 
not available (cr c ^(t) = 0). The total number of capabilities which can be acquired is 
fixed to some Na 3> 1- 

Productions - Capability Gain. Capabilities are acquired through a novel 
combination of already present capabilities. This is encoded in the production rule 
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table ct\- k . If capability i can in principle be acquired through a combination of j and k 
then a\- k = 1, otherwise a\- k = 0. Note that each the production rule table is the same 
for each country. The production process is then given by 

a c ^{t + l) = at jk <T c j(t)a c>k (t) . (10) 

There are on average r + unique and different production processes for each capability 
i, with j and k chosen at random from the entire set of capabilities. 

Product disappearance. If a product requires a set of capabilities which is a 
subset of capabilities required to produce another product, it may disappear. The idea 
is that the product with the larger set of capabilities is a more complex and advanced 
product substituting its antecedent (e.g. wikipedia replacing an encyclopedia). Assume 
product p requiring capability i and product q requires capability j. If there is a 
production rule a\- k then product p requires capability j and k and is thus an improved 
version of product q. We incorporate this as competitive replacement in the model by 
assuming that with probability p~ a production rule for i leads to a negative influence 
on j. For this the destruction rule table ct~- is introduced. It encodes whether capability 
% leads to a substitution for j (a~j = 1) or not (a~j = 0). The destruction process is 
given by 

<T c , i (t + l) = l- a r j <T C j(t) . (11) 

The destruction rule table is ubiquitous for each country. 

Migration of Capabilities. In addition to the deterministic production and 
substitution processes there is a stochastic migration of capabilities among the countries. 
With probability p a capability migrates from a country c\ to another, randomly chosen 
country C2- If o'c 1 ,i(t) = 1 we set (T Cl j(t) = and o~ C2j i(t) = 1 with probability p. 

From Capabilities to Products. Each product needs n a different capabilities 
as input. This is accounted for in the capability x product matrix Mj P . If product 
p requires capability % then M ip = 1, otherwise M ip = 0. Thus each column has n a 
randomly distributed nonzero entries. In each timestep it is checked for each product 
whether its set of required capabilities is present. Thus 7r CjP (£) = 1 if and only if 
Z)i AfipO^f) = n a . 

11.0.2. Dynamical Algorithm The Schumpeterian dynamical algorithm for this model 
is given by: 

• Pick a country c at random. 

• For each capability i with a C;i (t) = 1, set cr Cji (t) = and <7 c ',j(0 = 1 with probability 
p (where d is another randomly chosen country). 

• Pick a capability % at random. 

• Sum all productive and destructive influences on i, that is compute Aj(t) = 
£; lfc a,V^(*K*(*) " ^WcM If Ai(t) > (A t (t) < 0) then a c ,(t + 1) = 1 
(<T c>i {t + 1) = 0), otherwise a Cji (t + 1) = a Cji (t). 
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• Continue picking capabilities until all capabilities have been updated, then go to 
the next country. 

• Once all countries have been updated, update the product vectors 7r C)P (t) and go to 
the next timestep. 

The model is initialized with a number of countries N c and products N p taken from 
the data. As further model input serves the number of products that each country 
exports at the initial time-step t — (corresponding to 1984 for the data). Let this 
diversity be D c (0). We set cr c ,i(0) = 1 with probability (D c (0)/N p ) 1 / na . This guarantees 
that the initial model diversity ^ p 7r c ,p(0) equals D c (0). After initialization, the model 
is iterated over T' time-steps. Assume that we compare the model to an empirical 
timeseries over T years. The resulting model trajectories are segmented into T segments 
of equal length ('model years'). The value of T' is chosen such that the absolute number 
of appearance and disappearance events is the same in model and real world data. This 
rescaled model data is now called x(p, c, t) and can be compared to the real world data 
x(p,c,t). We will denote model entities by a bar, e.g. A(p, c, t) denotes appearance 
events obtained from x(p, c, t) via the same procedure as for x(p, c, t). 

The results of a comparison between the Schumpeterian diversity dynamics model 
and world trade data are summarized in Fig. HJ Here the model parameter r + = 1.65, 
p~ = 0.15, p = 0.002, Na = 100, n a = 2. All other model parameters can be measured 
in the data. In Figs|IJa),(b) the initial and final product diversities for each country 
are shown. In both cases the countries are ranked by diversity. Per calibration the 
initial diversities are equal. The final diversities (b) also overlap to a large extent. A 
more detailed view of this is given in Fig. HJc). Here the increase in product exports for 
countries of a given rank is shown. A remarkable peak is visible in both model and trade 
data. In the model this is explained by the onset of a creative phase transition [T5].|16j. 
As a country gradually acquires novel capabilities it eventually reaches a point where 
it has accumulated enough critical capabilities to diffuse into the entire product space. 
Creative bursts of diversification are the consequence. Countries below this critical point 
experience stagnation, whereas countries above this threshold reach a fully diversified 
plateau. 

The results reported are independent of Na [15]. The variable p merely sets the 
time-scale for T' . The parameters r + and p~ determine the number of capabilities 
required for creative bursts to set in, that is the phase transition to a fully diversified 
country. Changing these values rescales the peak in Figfjjd). The main conclusions 
however remain intact. 



